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ABSTRACT 

Intended to supplement previously reprted conference 
proceedings on the preparation of special educators, the paper 
presents an overview of the structure of performance-based teacher 
education (PBTE) and certification, reviews research on selecting 
relevant program options, and identifies future research needs. 
Briefly covered are the following structural phases: selection, 
training, certification, and process and product criteria for 
evaluating work success. The author points to limited research bases 
for selecting one PBTE program option over another and recommends 
using the "General Catalog of Teaching Skills" as one approach for 
identifying skills and knowledge that should be incorporated into 
teacher education programs. (LH) 
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Richard L. Turner, Professor of Education and Associate Dean for 
Research and Development at Indiana University, Bloomington, 
presented the paper, Research Bases for Performance-Based 
Teacher Education Options at the CUNY-UFT Conference on 
CBTE at Tarrytown on March 2, 1974. The United Federation of 
Teachers and the City University of New York gave permission to 
reprint this copyrighted paper which represents substantially the 
material Dr. Turner presented to the Glenmont conference partici- 
pants on May 15, 1974. The editors of Design for Competence 
Based Education in Special Education are pleased to present this 
reprint as an integral attachment to the Proceedings from the New 
York State Conference on Competence Based Teacher Education 
and Special Education. 

This reprint is a product of Grant No. OKG-0-73-6140 funded by the United States 
Office of Education to the Board of Regents of the University of the State of New York 
and administered by the State Education Department, Division for Handicapped Children. 

The CBTK Conference was conducted at Glenmont, New York on May 14-16, 1974, in 
cooperation with the Division of Special Education and Rehabilitation of Syracuse 
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RESEARCH BASES FOR PERFORMANCE-BASED 
TEACHER EDUCATION OPTIONS 



In talking with you this afternoon I will be addressing two related topics. The 
first is an overview of the structure of teacher education, followed by a review of 
some of the research germane to various points in this structure. The outcome of 
this topic should be a clearer idea of the various options available in performance- 
based teacher education, and a notion ot how research findings bear on these 
options. The second topic picks up on a question left unresolved in the first 
topic ~ finding a rational way to select among all of the skills teachers might need 
to function effectively in a highly pluralistic society, in which various parent groups 
and their attendant children, various schools, and perhaps, various power blocks,** 
wish to see evidence of prescribed types of teacher behavior. In addressing the latter 
topic, I will refer principally to the approach we took to this quf^stion in construc- 
ting the General Catalog of Teaching Skills which we recently completed at Indiana 
University. 



The **structure'* of teacher education or a paradigm of it, which I will just 
outline now and elaborate on later, will serve as a guide throughout the discussion. 
(Refer to the figure on page 168.) Part of this structure deals with the four phases of 
teacher education. In Selection, the first phase, there is a choice-point — here one 
can choose to be "unselective," within some limits, or selective. From this point, 
one usually enters a training phase for which there are three options. Most people 
experience "knowledge training*' — that is, they engage in a symbolic or verbal 
activity — in their teacher education programs- They may also engage in skill 
development which in our program and the Stanford University system is fre- 
quently referred to as ^'simulation." During this process, the trainee builds up 
classroom skills to a functional but not polished level. In field experience, the third 
option, the trainee integrates knowledge and skills so that his behavior is more 
polished, albeit not perfect. 



Overview 
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After the training - or in the latter part of the training, certification becomes 
another choice-point, along with the placement of the teacher. This choice is of 
initial importance, especially to teachers, because of the different teaching contexts 
in which they may find themselves in a pluralistic society. The schools they enter, 
whether urban or rural, inner city or suburban, working class or middle class, will 
influence the effectiveness of their skills. McDonald has taken this matter into 
account to a considerable degree in his study. 

(Kditors* Note: Dr. Turner here makes reference to a presentation at the 
CUNY-UFT Conference by Dr. Frederick McDonald.) 

"Utilization," a term now used at the National Institute of f'ducat: n, is 
conjoined to placement, but focuses specifically on how the talents are used in a 
particular position. In Indiana, it is not uncommon for a teacher to have a majt)r 
certification field and yet teach in a field in which he or she is minimally qualified. 
For example, an athletics coach might be teaching history or mathematics. A major 
issue in certification, then, is the job you are certifying someone to do. Will a 
teacher who has been certified primarily to coach be competent to do everything 
else that coaches might be asked to do in a small scht)t)l? 

The final phase deals with wt)rk success criteria. We can look at that in several 
different ways. When McDonald talked about teacher performance, he meant 
process criteria - acts which the teacher performs, the interactions of teachers and 
pupils. He also discussed student performance, which meant student achievement, 
as one kind of product criterion for teacher performance. We have done st)me 
studies on other work success criteria, one of which relates to student appraisals or 
reports of teachers - that is, student evaluation of instruction. I consider this a 
product criterion becau.se what students think of their instruction is an tnitcome t)f 
that instruction. 

A third product criterion is "professional judgment" or the professional reputa- 
tion of a teacher among the administrative personnel, the teacher's peers, a college 
supervisor, or anyone who can render a professional judgment about teaching. The 
reputation that a teacher holds, in other words, is another product of his or her 
teaching and is not just a process outcome, as many people believe. This reputation 
may result in part from the duties teachers perform in the school other than 
classroom teaching. 

Under Program Options as Policies (H), one has a choice of placing different 
kinds of emphases on the options. For example, "non-selection" could be empha- 
sized under Selection, simulated skills under *rraining, and so forth. Later I will 
discuss some of the research data that relates to each of these choices together with 
the different ways we can do research studies bearing on the choices. 

At this point I might note that there are two basic research strategies. The first 
one, Development-Kvaluation Research Studies (D), may be most easily described 
here as making predictions about program t)utcomes. Let us suppose that we are 
running two programs. In one, we select the students on the basis of some criterion, 
and in the other, we have not selected the students. After the completion of these 
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twi) pnigrams, wc might stiuly the inuTiciliatf outcomes, l)ut implicitly wc must 
make a prediction .ihi)Ut the success of the unselectetl versus tlie selectcil people in 
their later work. 

In another stuily, ue might introtlucc a heavy knowleilgc emphasis into a 
program, together with a low skill emphasis, anii look at the outcomes of that 
program versus one uliich reverses the emphasis. In hoth cases, the researchers 
would be developing the treatments first in ortler to firul out the consequences. 
This type of research involves follow-up stittlies anil has some of the qualities of 
experimental ficKI stuilies. In both stuiiics mcntioncii, the researciicr must implic- 
itly predict that making a change in current practice will iiavegooii outcomes. 

riie second strategy, I'icKi Research in Tcaciicr Beiiavior anii Irffectiveness, 
which McDonalil illustratcii in his liiscussion, primarily mvolves the relationships 
between process criteria and proiiuc't criteria. Ihcse studies a nil there have been a 
fair number of them make the assumption that if the skills nt)w seen in o[)er<ition 
were to be taught to the teacher in pre-scr\icc training, these skills woubl be 
effectively implementeii at a later time. Let us suppose, for example, that we have 
assesscil the behavior or skills nf many thini grade teachers ami finil positive 
correlations between certain skills ami selected pu[)il outcomes, b'rom these results, 
for purposes of teacher eilucation, we must now assume that if teachers were 
trained in these skills at the prc-scr\ice level the skills-as-traincil wouKI be correctly 
exercise I later atui that positive pupil outcomes wmild follow. Tins <issumptiim, 
however, is rarelv tested. 

rluis, there is a gap in this strategy, aiul there is not much eviilencc that wt* can 
successfully jump over it. B\ measuring aptituiles, tlic MeDonaKI stuily is buihling 
in some protection, because afitituilcs are more stable than specific behaviijrs ur 
skills, rhe inductive and deductive abilities ami vocabularies of irulividuals change 
less than their ilay-t<)-da> or \ car-to-year classroom behaviors. Nonetheless, the 
study must make an inferential leap from measures of tcaclicr behavior in a 
classrt>om at any one pc>im back tt) the pio[)ositioTi that those behaviors are 
trainable and w ill later be used in the same form ni which they were traincil. 



Now that wc have an overview of the structure, I want to return arul exanune 
some of the research bases connected to the two options unilcr Selection. We can 
cast the first c)ption in question form: If admission iruo a program is unselective, 
how will things tend to turn out? One outcome is thai there will be more girls in 
the program than boys. In addition, the girls will sliow more commitment lo 
teaching than the boys will, (iirls also tend to clioose at an earlier age to tcai'h in 
elementary school ( Bal/cr, P>f)f)). I bcv apparently have dcculcil that this role fits 
them ami have iMrrieil that ulea or goal thrc)ugh into a teaciier education program. 
I his development, by tlie wa> , is an interesting aspect of sex role actjuisition, which 
we should study in greater deptii. 

A second c)utcomc shows tliat girls who enter secoiularv schof)! teai hing tciul to 
niakc this choice iliinng then high school years. IJoys gcnerallv do not chof)se this 
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course at all. They tciui ti) delay their cDinmitment to teaching a long time, ami as 
some studies have shown, frequently until placement time (Perry, 1972). To males, 
teaching often represents an alternative which they see as perhaps leading to other 
kinds of positions or jobs at a later time. Teaching is a career which boys frequently 
believe they should choose not to enter if they can avoui it, or can i)pt out of if 
they want to. Thus, present studies suggest that there is much less commitment to 
the profession among males than there is among females. 

.•\ study conducted at Louisiana State University (Ducharme, 1970) revealed 
positive outcomes fi)r teachers who had had voluntary experiences working with 
children before they entereii their preparatory program, .\iiministrative officers in 
the schools rated these teachers higher than those without such experience, thus 
demonstrating a positive CDrrelation between early teacher work experiences and 
subsequent administrative ratings. SecondK . these teachers tended to get somewhat 
better achievement gains from their stuiients than those not having such experience. 
These findings probabh relate ti) early interests of the teacher, who has been 
practicing some form of the teaching role l)efore entering a preparatory program. 
We shouKi look at these t)utcomes more closely. I am not sure that asking somet)ne 
tt) get this early expenence wduM produce the same effect, If a program requires 
early work experience with children, then everybi)tiy will have it whether they like 
it or nt)t. ami no correLitii)n will exist. 

A researcher at the United States International University (Baker, 1970) also 
conducted a study bearing on Selection. In it students were given intake interviews 
during which they were rateti for potential suvcess. After these students entered the 
field as teachers, the researcher askcil their supervisors or principals to rate them. 
The outcomes indicated a correlation between the intake interview ratings of the 
students as freshmen and the subsequent ratings of them as teachers by the 
administrative officials. Tiiis leads me to believe that there ma\' be some consis- 
tency in professional judgment. That is, if a committee of professionals handled the 
selection operation for a teacher training program, they might be able tt) pick 
people who would subsequenth' be seen as successful teachers by other profes- 
sit)nals. 

A study conducted in Brooklyn by Shirley Weeden (1972) beginning in the 
mid-1 96(iVs showed, however, that out of the hundred or so students dismissed 
from the Brooklyn preparatory program over a period of four or five years, about 
eighty percent found their wa\ into teaching one way or another. It seems clear 
from this study that one cannot easily stop up the holes in the teacher training 
system. If New York State programs eliminated all of the trainees they believed 
would be unsuccessful, these trainees would probably go to some other state and 
get their teaching certificates there. Fliminating students at certification time may 
or may not be more effective. Again, the students may come in by the back door, 
and the state will again be l)ack to the certificatit)n problem it thinks it has solved. 

A number of currently used measures are not ver\ gootl prcilictors of a person's 
Work success, If given early, achievement tests, graile point averages, and other 
intellective measures usually dt) not predict work success (Ducharme, 1970, Baker, 
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1970), ;iltlu)iigh there is some eontrary evidence {Bai/.er, 1966). In the Balzer study- 
data, girls who hail iJeeiiled earh' to teaeh sehool anti had entered teaehers colleges 
turned out to be superior, on seven of the eight variables measured, to girls who had 
dcciiled much later to teach. These variables were predominantly intelleetive items 
such as vocabulary, language ability, ami niathcinatics. However, the study did not 
follow these girls far enough to tell us whether or not they made better teachers. 
.\lthough we all engage in some kiml of screening-out procedures - for example, by 
not admitting students with below C averages into the program, we do not get very 
good predictions with the ones accepted above that level. MeDonald, however, may 
get better results in his study because his measures are more specific than grade 
point averages or general achievement tests. 

One variable we looked at at Indiana seems promising (Yoder, 1967, Turner, 
1967). It is called the "Organized, Business-I.ike Behavior" versus "Slipshod, 
Irresponsible Behavior,'* and appears in scales originally lieveloped by I). G. Ryans. 
\Vc tested tramees on these scales prior to stuilent teaching and then looked at 
those who had discipline problems during their field experience. 

rhe results of this stuiiy indicated that the student teaehers who scored high in 
organized, business-like behavior had signifieantly fewer discipline problems than 
those who scored low, i.e. were slipshoii and less responsible. We also found the 
same results in a predictive study of first-year teachers with the more business-like 
teachers having substantially fewer discipline problems during the first year. 

It would be possible. I think, to generate some eriteria by whieh selection into a 
program can be controlled. I'inaneially speaking, there are sound reasons to do so. 
Let us take, for example, a student who has gone through three years of a program, 
but is eliminated in the fourth. Whether we like it or not, we have run up that 
student's costs and probably affected the rate of return on his education. It is 
certainly more humane to eliminate people relatively early in the program than to 
allow them to continue and then eliminate them at the end. In any event, I hope 
that in performance based teacher educatit)n we will think about the possibility of 
selecting potentially successful people rather than placing all of the emphasis on 
training. 



Training 

Framing, as an activity of Development and Kvaluation Research Studies, is a 
process that brings about an outcome. In a program which emphasizes knowledge, 
we need to know whether the level of knowledge a student attains as an outcome of 
his program is in fact a good predictor of teacher success. Neither college or high 
school grade point averages are good predictors. One study (Baker, 1970) found 
that although college (iPA's correlated with student teaching grades, they did not 
correlate with work success ratings by professional judges. Standardized achieve- 
ment tests given to teachers seem to be non-predictors of the attainments of 
teachers. Paper and pencil tests, such as final examinations in methods courses, also 
usually do not predict teacher success. 
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Problem-solving tests used to measure teacher knowledge do show some relation- 
ships to outcomes, but they have to be done in a particular way. A performance 
test which I helped develop in the early 60's contained one section which presented 
examples of pupil work from which teachers had to diagnose pupil difficulties. In 
another section, we asked teachers to organize a group of elementary arithmetic 
problems in thv order of their difficulty for the pupils. Among the various test 
items we used, the teacher's ability to diagnose student difficulties and to cirgani/.e 
materials turned out to be one of the best correlates of arithmetic gain in the 
classroom (Turner, 1964). Sequencing ability, or an ability to organize material 
hierarchically also turns out to be an aspect of knowledge training which is a 
reasonable predictor of teaching success (Trojcak, 1969). Therefore, the point here 
is not whether teacher knowledge correlates with student learning, but how we get 
at the knowledge of the teacher as a predictor of success. A teacher's ability to use 
knowledge to diagnose pupil problems and to sequence materials seems to correlate 
with student outcomes. 

The training and protocol movement offers a third way for developing knowl- 
edge. Instructors within this movement try to teach a concept to trainees by 
showing its instances in the classroom, and then they evaluate whether the trainee 
has learned the concept in an applicable way. For example, a trainee would learn to 
recognize instances of a particular kind of behavior from a film showing real 
classroom situations. Such a film might depict examples of positive or negative 
reinforcement, .shaping, or any of a wide variety of concepts. Protocol materials, 
then, stress that the trainee apply rather than merely verbalize knowledge.* 

Trainees, as they viewed this film would code approval or disapproval and praise 
or rejection behaviors. This kind of training treatment provides a better test of the 
knowledge of concepts used in teaching than most of the tests we now have. 

We followed the u.se of protocol films with training materials which asked the 
teacher to produce the behavior that the concept symbolized. That is, the teacher 
would actually have to demonstrate behaviors depicting approval, praise, reinforce- 
ment, criticism, and rejection. Through this method we can test the level of skill a 
trainee has before he or she goes into cla.ssroom situations. It is accurate to say that 
we can increase that person's skill level prior to classnjom performance, but final 
polishing of the skill requires classroom practice. 

In a research seminar I conduct on the assessment of teacher performance, one 
of my students raised the question, "What are the characteristics of approval 
behavior, and how can we array those characteristics on another rating scale to 
show the quality of approval or disapproval behavior?" This student will spend 
about a year working on this problem. Finding solutions to such questions requires 
tremendous ai^iounts of time and money. The McDonald study is trying to find out 
whether such actions as approval or disapproval make any difference. To do so, 

*At this point in the presentation. Dr. Turner ran a film dip showing hricf instances of 
classroom situations in which teachers demonstrated the behaviors or concepts of approval and 
disapproval, praise and rejection. He explained beforehand, however, that these concepts had 
not been proven to In highly effective - that is, "approvaP' on the one hand or "disapproval" 
on the other did not invariably correlate with pupil outcomes. 
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these researchers will have to develop a special scale which includes all those 
attributes showing the quality of approval - disapproval and correlate that to pupil 
attitudes, achievement, and other outcomes. At the present time, Performance- 
Jiased Teacher Kdueation forces us to a*;sinne the answer to such questions, and I 
do not think we can now assert what the final research outcome will he. 

Jvxperimentai studies (Hiller, 1972) have shown that teacher vagueness is a 
characteristic which correlates, in the knowledge domain, to pupil outcomes. 
Vagueness, for example, can he controlled in a speaker hy depriving the speaker of 
information. If one provides trainees with only a small amount of infcrmation on a 
subject and asks them to give a speech on that topic, they will, of necessity, have to 
use a pretty vague approach simply because tiiey do not know very much. But if 
one gives them more information, their speech will be less vague. V'agueness. then, 
is partly a function of the amount of knowledge a person has. I do not want to 
become excessively political here, but vagueness ratings of the Nixon-Kennedy 
debate showed that Nixon was more vague than Kennedy. Witiiout carrying the 
matter too far, I will just add that it is of course possible to be vague for reasons 
other than a lack of knowledge. 

McDonald, together with others, has created at Stanford University one of the 
very good skill-development systems. Altiiough he is more expert in this area tiian I 
am» I will take the liberty to briefly discuss some of the things lie has done. To use 
his expression, this explanation is accurate but not necessarily precise. In this 
system, the trainee has a video tape made of his or her own performance in a 
mieroteaching situation. The instructor views this tape to see what the teacher, 
without training, can do. The instructor then shows the trainee a model of a teacher 
engaging in a specific kind of behavior that the trainee should be able to perform, 
such as praise or approval, or asking higher order questions. 'I' he trainee analyzes 
and identifies the components of that model teacher's behavior and compares it to 
his or her own videotaped performance. Then the trainee teaches again and the 
istruetor observes the new performance on videotape and appraises the extent to 
which the trainee has improved in the skill since the pre-test entry point, or the first 
videotape, as the result of the interim training. In this case, the instructor uses a 
modeling treatment in which the student sees a teacher do sometliing and then 
imitates that behavior. 

'I'hese systems, which demonstrate an emphasis on skill development under 
simulated conditions, can clearly produce again in skills without the trainee having 
direct field experience. Altiiough I am neutral on the question of performance- 
based teacher education, I feel that if we are going to make progress in the teacher 
education domain, we should place substantially greater emphasis on skill develop- 
ment in undergraduate training. Our present programs tend to be based too much 
on the verbalization of knowledge and not enough on the expression of knowledge 
in operable skills. A modest alteration m the programs along these lines would 
surely lead to a great deal of improvement in teacher education. 

The question of field experience is more complicated. We have a tendency to 
think that field experience is always beneficial. This morning, one person in our 
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audience suggested that the practitioner ought to be doing all the training of 
teachers. There is some merit to this» hut it is not entirely clear exactly what 
outcomes we would get unless we did it with a great deal of care. E'or instance^ 
there is not much evidence to support the widely held assumption that teaching 
experience leads to increased skills. The analysis of the evidence we picked up in 
the 6()*s shoves that the curve for skill acquisition in the teaching of reading and 
arithmetic rises very rapidly during student teaching, when the trainee is under 
supervision. It also rises rapidly the first two full years of experience, hut here the 
gain is a function of the amount of supervision the teacher gets ( I'urner, 1965). By 
the amount of supervision, I mean the number of quarter-hours of contact super- 
visory personnel spend with new teachers. 'I'hese personnel would he trained 
supervisors in reading or mathematics who know their areas thoroughU' and make 
their living in this job but interestingly enough, typically not principals. In the 
studies I am referring to, supervision tends to be the chief variable in explaining 
skill acquisition. If the teachers are not supervised, they apparently are not able to 
increase their skills very nurch. 

Most teachers acquire the skills they are going to utilize by the fifth year and 
from then on, they probably practice what is most comfortable. There is an 
increment only if there is some kind of intervention. For example, one of our 
faculty members has developed an in-service intervention called **mastery learning** 
(Okey. 1972). This instructor provides the teachers with training booklets that are 
designed so teachers can complete each section at home in about one-half hour each 
(between TV programs). Okey has found that teachers do not respond favorably to 
these materials if the sections are longer, if only for the reason that the sections 
have to be worked in between other duties. The sections require the teachers to 
form objectives for their instruction, make diagnoses, prescribe further instruction 
as a consequence of diagnosis, collect their materials, engage in actual instruction, 
do remediation, and evaluate. In other words, the instructor is teaching each of 
these skills step-by-step to teachers. The inservice teacher then goes into the 
classroom and practices each of these steps or skills with a group of pupils. 

Okey has used sceral kinds of evaluation designs with this mastery learning 
system. In one, he has had teachers divide their students into two groups, one of 
which is taught by th * newly acquired teaching procedure, while the other group is 
taught by the teachri*s usual procedures. The two groups are then compared. In 
another design, the teacher uses the mastery system in teaching one unit of 
instruction, but not in the next unit he or she teaches. For example, if the topic is 
fractions, the teacher will use the particular skill or skills to teach a unit on simple 
fractions, l)Ut not use the system to teach a more complex set of fractions. The 
outcomes of the two units are then compared. As a consequence of this interven- 
tion system, even with teachers who have been out in the field for fifteen years, 
student performance increases. 

A field -based program, to be effective, must include resources for enough 
supervisors to have very close control over what happens during the field experi- 
ences of trainees. The program shturld not simply throw the trainees into the field 
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with ;i number of teachers and expect the trainees U) develop a solid set of teaching 
skills. 

One ot the concerns ot many people about performance-based teacher education 
IS the amount of emphasis the instructor shouUi place on pedagogical skill develop- 
ment and field experience. A variety of program options, each with different kinds 
of emphases or levels oi emphasis, is one way to meet this ci)ncern. At Indiana 
University we have instituted alternative teacher educatii)n pri)grams, each one 
different from the other. We have a multicultural pri)gram with a heavy field 
component m which, for example, trainees are working on Indiana reservations, 
teaching the children ami learning abi)Ut Indian culture. In ani)ther nu)re traiiitional 
pri^gram, the trainees receive heavily km)\vleiige-laden training after which they do 
student teaching. A third program emphasizes Proti)col film and simulation ma- 
terial, and after training, conducts some performance testing. Judith Henderson's 
program at Michigan State also has a heavy element of simulatii)n training related to 
skill acquisitii)n. There is no one way to judge which of the three emphases is best. 
l\*rformance-based programs should reflect a ^ood ileal of diversity in order to 
optimize specific skdl. knowledge ami attitude outcomes anu)ng trainees who vary 
in aptuude and in personal characteristics. 

Certification 

In certification, '^program approval" is one optii)n. It is the traditional i)ptii)n. 
The chief difficulty with this option is that one eanni)t be eertain how much skill a 
particular individual actually has when he or she ci)mes out of ci)llege. Can the 
perst)n perform well in an actual classroom? The trainee will not have had very 
much classroom experienee perhaps only eight to twelve weeks of part-time work 
during student teaching, We could hardly expect anyone to acquire very high levels 
of skills during such a short period. But with PBTH program approval, which New 
York State already has, programs will probably be able to increase the ami)unt of 
quality control ove»- programs and thus speak more directly to questions ot 
individual performance. Nonetheless, at the moment there is a poor research and 
development basis for such programs. People who specify that certain skills should 
be acquired are going beyond current knowledge since they cannot demonstrate 
that those skills make any difference in teaching. Initially at least, they are just 
hoping that they will. The development-evaluation paradigm which I mentioned 
earlier (see line D in the figure) will, however, permit teacher educators to test their 
hopes against actual evaluation data. 

We should be very cautious about making promises about the delivery of even a 
•^Program Approval" option in PBTK. It will probably take over ten years to find 
out whether PBTK is (or was) worth the investment. Only then will we have a 
chance to see a large number of trained teachers in different types of performance- 
based pn)gi*ams functioning in the field. 

Intiividual approval is the other certification option avadable. A certification 
l)oard wouM in some manner test an individual at a particular time on a perfor- 
mance basis and decide whether or not that person demonstrates adequate perfor- 
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mance. It is, however, very difficult for me to imagine a testing center comprehen- 
sive enough to give a fair test to an individual performer over a short period of time. 
Recently, when I was in Syracuse, some people there were discussing the possibility 
of a teacher counseling and improvement center. Here, after different kinds of 
measures have been used to assess and diagnose what a teacher can do, the teacher 
would tr\' to improve those skills needing attention. This approach is different from 
attempting to certify someone by testing them at a skill center since it diagnoses 
deficiencies and attempts to correct them rather than to evaluate them for purposes 
of certification. In all, state approval of performance-based teacher education 
programs, is probably the least noxious option in this domain. 

Work Saccess Criteria — Process Skills 

Applying process criteria involves observing the acts of the teacher and judging 
whether those acts are performed at an adequate level. The problem with process 
criteria, even if we find correlations between teacher performance and pupil 
performance, is the appropriate or optimum level of teacher skill performance. In a 
performance-based program, we must ask what degree of skill or level of behavior 
somebody has to engage in to be considered skillful. But how are we going to judge 
this from correlational studies? It is a question of the standards we apply. We 
ultimately cannot avoid making judgments against these standards. The problem is 
that research studies do not tell us the appropriate standards or levels for specific 
skills. As a matter of fact, there is reasonable evidence that some behaviors are best 
practiced in the middle range - not too much or too little. 

It might be thought that if we greatly emphasized a particular behavior or skill, 
we would get more achievement gain from students because the relationship is seen 
monotonic and linear, always increasing. However, a behavior might have bad 
effects if over-emphasized. A study by Robert Soar suggests that teachers who 
allow too much physical freedom in the classroom get unfavc -able results, whereas 
teachers who do not allow enough physical freedom in the classroom also get 
unfavorable results (Soar, 1973). The question is, how much is enough? I cannot 
answer that question. I am not even sure I know how to answer it. New York State, 
however, is certainly going to be faced with questions of this kind within the next 
year or so in one form or another. What are the standards to be applied to teacher 
performance? 

Product Criteria 

As for product criteria, McD >nald has already presented some factors he has 
examined in pupil achievement. I would like to add to that some points on 
professional judgment and student reports. We obtained some rather surprising 
results from a study we conducted at Indiana in first-year French courses. For two 
years, we looked at all of the teachers in beginning French - eighteen one year and 
twenty-four the next year. In these courses, we measured student residual 
gain - the amount of learning as evidenced on the final exam corrected for the 
initial or entry level of learning. We also looked at the performance on the mid-term 
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test and the third-quarter test. We then had the students rate the teachers at the end 
of the course. We found that teachers who were rated high had less student 
achievement (Turner and Thompson, 1974). In this context, this finding suggests 
that a program which has as criteria both pupil achievement data and pupil report 
data about teachers will run into problems because the two criteria are negatively 
related. We also found the largest factor in the rating is how well-liked and how 
warm the teacher was. In general, the warmer a beginning French teacher is, the 
worse his students do. 

These results probably depend very much on the distribution of student ability 
in learning French. We obtained stronger relationships in the first year when we had 
all the students rate their teachers. We lost students the second year, and the 
relationships were a little weaker, but still significant. In the second semester of 
French, we had lost about fifty percent of the students because it seems that the 
students were not disposed to take the next semester, at least immediately. They 
may have learned a good deal the first semester, but they apparently did not like it. 
In any event, the correlations for second semester were still weaker than those for 
first semester. A drop-off rate in a course may be inconsequential in public school, 
but it is not in colleges which have tuition-paying students. A French Department 
determines its number of faculty positions in considerable part by how many 
student credit-hours it has. Clearly it has a problem when it loses fifty percent of its 
students between the first and the second semesters. The department is thus faced 
with a bad set of choices — shall it lose enrollment because of unpopularity in its 
teaching or shall it go for increased student gain? These are difficult choices to 
make. 

I want to add here that in public schools it is indeed serious what students report 
about their teachers or their schools. Bad reports reflect on bond issues, tax levies, 
and other matters that duly influence the financial conditions of a school system. 
The relationship, however, is not as direct as in colleges where tuition income is 
directly involved. 

We have some other results that are tied to the relationship between process 
criteria and product criteria. McDonald's study is going to produce some of these 
relationships; however, there have been a good many studies in the field, and I 
doubt that his study will be conclusive. One of these variables is ^'indirectness," 
which in the Flanders system means the frequency with which the teacher accepts 
pupil feeling, approves pupil behavior or gives approving responses, and responds to 
or expands on student questions. In general, "-ndircctness" is a positiA e response to 
students. 

Another variable concerns the amount of criticism and the amount of self-justifi- 
cation that a teacher goes through. "Indirectness," or the first three approval 
categories, is placed over the criticism and self-justification categories to give the i/d 
ratio. Using this ratio, researchers try to predict which teachers obtain the better 
learning from students. The results of these studies, as reviewed by Dunkin and 
Biddle (1974), indicate that in nine out of eleven studies, pupils liked the indirect 
teacher better than the direct teacher, while in 33 studies relating i/d ratio to 
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student achievement, fifteen studies showed a positive relationship, fifteen showed 
no relationship, and three indicated a complicated kind of relationship. Should 
instructors train teachers to be indirect? Maybe it helps and maybe it doesn't. 
Insofar as praise and approval are concerned, one study of those teachers who gave 
more approval showed better pupil attitudes and four did not show any relation- 
ship. With respect to student achievement and the amount of approval, three 
studies produced positive relationships and eleven showed no relationships at all. In 
studies of the relationship between acceptance of pupil ideas and student attitudes, 
three studies resulted in positive relationships and three showed no relationships. 
With respect to the relationship between the acceptance of student ideas and pupil 
achievement, one study was positive, and five studies showed no relationship. With 
respect to teacher criticism of students, a recent study by Brophy and Kvertson 
indicated that middle class students learn more under critical teachers, while lower 
class students learn less. 

These studies are bothersome from the viewpoint of PBTE because they deal 
with well-researched variables, and yet, pupil outcomes cannot be consistently 
predicted from them. We really do not know what kind of teacher behavior 
consistently relates to pupil achievement outcomes. Should we certify a person who 
is indirect and not certify the one wht) is direct? There is insufficient evidence to 
support advocating a particular set of skills in this domain. 

Data on classroom management rests largely on the work of Jacob Kounin 
(1970), but unfortunately there has been little replication of his work. In classroom 
management, Kounin identified a variable called "withitness,'* which means the 
extent to which a teacher can keep several things under control in the classroom at 
the same time. Teachers working with small groups would be able to monitor, say, 
the reading group with their right eye and all the pupils doing seat work with their 
left eye, and with the eye in the back of the head still be able to see what is going 
on in the rest of the room. In the work done by Kounin, "withitness*' was 
correlated with another variable called "overlap," a behavior in which the teacher is 
able to do two or mor * things at once. The degree to which a teacher possesses this 
skill is a key factor in whether the pupils in the class are on a task or show deviant 
or off-task behavior. For overlap to be a relevant variable, however, we first have to 
assume that the particular task assigned to the pupils or the task they themselves 
have undertaken is important behavior. If we agree on that, then the amount of 
overlap the teacher shows will be an important correlate of the on-task behavior of 
the pupils. 

Using a test built by Schumm (1971) on "overlap*' some fairly interesting results 
with student teachers were obtained. The following is an example of an item in this 
test. A slide showing children in a classroom situation with their hands up is briefly 
presented. At the same time, the school principal reads an announcement over the 
loudspeaker. Afterwards, the teacher is instructed to write down both how many 
children had been pictured with their hands up and what the principal had said. 
Interestingly, the results of this test predicted that the elementary school student 
teachers who had more overlap had fewer discipline problems. The results also 
indicated that teachers who liked to work with multiple small groups rather than a 
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single large group scored higher in overlap behavior. However, this study showed a 
peculiar correlation to the studyhall behavior of teachers - the less overlap they 
had, the better off they were in supervising students in studyhall. This happens, 
perhaps, because studyhalls are a unidimensional activity. Teachers in studyhall do 
not have very many things to manage simultaneously. 

Other important characteristics of classroom management include "smoothness" 
or the flow of teacher behavior, group learning in which the teacher makes sure the 
group knows what he or she wants them to do, and accountability, or making kids 
responsible for the consequences of their own behavior. Valence or challenge 
arousal — or the ability to stimulate enthusiasm and keep pupils involved is another 
important behavioral variable. This behavior, which is called stimulus variation in 
the Stanford system, requires the teacher to use a variety of materials and many 
different activities to keep the pupils' attention on what they should learn or what 
the teacher is saying or doing. 

Turning now to behavior modification, there are several highly specialized 
studies which support the notion of targeting reinforcement to get a particular 
behavior. Let us suppose I have targeted a class of behaviors that I want to change 
in someone. For example, I want to get a pupil I'll call Jane to smile more often, so 
I would address positive reinforcement to that behavior. Everytime Jane smiles, I 
would say, "That's good, Jane" or perhaps just smile back. Pretty soon, Jane smiles 
more. But if Jane raises her left eyebrow, I do not say, "That's good," I don't say 
anything. Targeted praise is probably a more effective positive reinforcement than 
general praise. 

The best systems for determining a teacher's positive and negative responses are 
the rapid-marking type systems, as in Flanders' CIA, where the observer rates a 
response once every three seconds, and the time-sampling scales in which the 
observer looks at teacher behavior and then checks the appropriate categories once 
every five to ten minutes. 

Incidentally, the question of the reliability of observation is best handled by 
inter-observer agreement. For example, if you have three observers in the room 
checking the frequency of smiling, and they all produce the same rating, then we 
have what is technically known as "objectivity" in that behavior, as validated by 
inter-rater agreement. If only one person observes the behavior, we can assume that 
this observer's perception is subjective and cannot be verified. As the McDonald 
study indicates, inter-observer reliability is most easily done by multiple observers 
viewing a videotape of a teacher. 

The number of different teacher behaviors and skills which can be listed in an 
observation schedule or category system is, of course, very large. I have noted above 
only a few, predominantly those with some research history behind them. The 
broad question facing the creators of performance-based teacher education pro- 
grams is to find a way to select the skills to be included in the program when only a 
few have at this time been shown to have any claim to validity. In constructing the 
General Catalog of Teaching Skills we tried to confront this question squarely. Our 
resolution of it is described below. 
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A Differentiated Approach to Teaching Skills 

The question of identifying teacher skills functional in a pluralistic society, and 
also observations of what goes on in schools in a wide variety of contexts, led us 
not to specify a single undifferentiated list of skills in constructing the General 
Catalog. Rather, we first divided the catalogue according to subject matters, adding 
separate sections for "Socialization and Classroom Management" and for "Early 
Childhood," which is an age level and not really a subject matter. As a second step, 
we used Bruce Joyce's idea (Joyce & Weil, 1972) that within subject matters 
teachers use different approaches to or different models of instruction. Here we 
tried to reflect schools of thought within different subject matters. For example, 
some segments of society believe that corporal punishment is moral, and others, 
that it is immoral. As researchers, we are confronted with these two opposing value 
systems. In the catalog, we tried to represent accurately a pluralistic system in a 
fairly objective way. 

In the chapter dealing with "Socialization and Classroom Management," we 
came up with four schools of thought, one of which we called the authoritative 
approach. For teachers who take this approach, we included skills in administering 
corporal punishment, with the idea that if pupils are to receive corporal punish- 
ment, the teacher ought to give it in a skillful way. Many other skills related to 
"keeping discipline" are, of course, included. 

The second approach, social-developmental, which has a long history in this 
country, and is identified primarily with Progressive Education, stands in substan- 
tial contrast to the authoritative approach. The cognitive developmental approach, 
the third approach employed, is derived from Piaget and Kohlberg, and deals with 
the open-classroom style of management and socialization. The fourth school ot 
thought is the non-directive, or represents the Carl Rogers-Virginia Axline, 
approach. The skills to be acquired under this approach are very, very different 
from those identified with the authoritative approach. 

Within a subject matter, a teacher deals with several functions including goal- 
setting, arranging the environment and selecting materials. Many studies have 
focused on another function - interaction or face-to-face instruction. However, in a 
system Lawrence Stolurow invented several years ago, there was substantial empha- 
sis on what he called the pre-tutorial or planning behaviors - materials selection, 
programming behaviors, and so forth - that go into teaching, and less emphasis on 
the interactive aspect. Thus we identified these pre-tutorial functions as involving 
important skills which are frequently overlooked. Diagnosis and Evaluation are also 
functions of teachers. Most teachers assign grades; they also try to figure out why 
certain of their pupils are not doing as well as they should be doing. After 
considering these various functions of teachers, we decided to divide all approaches 
within subject matters according to the type of skills needed to perform each 
function. 

Next we divided the skills according to the types of environments in which they 
could be observed. One type of environment is the college classroom and learning 
laboratory. The other context is micro-teaching or the actual classroom. Finally we 



181 



listed indicators for each skill under those contexts. For example, a skill under 
"Non-Directive Teaching" is "establishes and maintains rapport with students." Let 
us take a look at how we might test that in the college classroom or learning lab by 
looking at certain indicators. We could give the trainee a transcript of teacher-pupil 
interaction and a form on which he or she could code teacher talk as rapport- 
building versus threatening or rejecting. Or, we could show the trainee a film or 
videotape of teacher-pupil interaction, after which he or she would code the same 
behaviors using I' landers' categories U 2, 3, 6 and 7. In microteaching or in an 
actual classroom, our list of indicators for "Kstablishing and Maintaining Kapport" 
would include: greeting individual students in a relaxed way, complimenting or 
praising students, making humorous, non-threatening remarks, soliciting student 
opinion or comments, listening to student observations or opinions and encouraging 
expansion, making observations or comments which indicate acceptability of stu- 
dent expression; and, avoiding the following behaviors: constantly passing judgment 
on student behaviors, dress, fric ' ; or family, derogating or criticizing students, 
frowning, constantly carping, and being physically withdrawn or aloof. Some of 
these behaviors are positive indicators of the skill that we are building and some are 
negative indicators. Trainees can code these actions, either from video-tape, to show 
that they understand the elements of the skill we are building, or from a direct 
observation, to determine whether the teacher has the rapport-building skills. 

If we decide to measure a skill like "Kstablishes Rapport," we must clearly 
identify the behaviors that make up or indicate that skill. How would 1 know, for 
example, that a teacher is enthusiastic, or that be or she can select appropriate 
materials? Unless we define and agree on the same behaviors in a skill category, we 
can never establish communication. When I search for the behaviors that constitute 
a particular skill. I am not attempting to control these behaviors, I am only trying 
to identify their components. In a pluralistic system, the programs, organizers or 
the instructors make the value judgments, Fhe researcher's job is to develop the 
materials that permit the instructors to express their own values. 

The General Catalog contains about fifty skills per section, but these behaviors 
are not independent of each other. When we take functions across subject matters, 
we get a repetition of skills, but in a different form. A question in teaching math is 
whether the teacher can evaluate certain kinds of mathematics behavior in children. 
We repeat the question in a different form for Knglish teaching. In other words, 
these are similar behaviors with different content. A teacher might be appraised as 
skillful in evaluating math performance and incompetent in evaluating performance 
in Knglish. Vet, there is a commonality in these teacher skills because both involve 
some kind of evaluative behavior. In essence, the situation appears to be this: the 
functions performed by teachers such as goal setting, arranging the learning environ- 
ment, interacting witli pupils, or diagnosing and evaluating are general or cut across 
subjects and age levels. Teaching skills, however, are specific since they involve 
opcratmg prcciseh within a subject matter while instructing a particular level of 
student. 1 1ms, all teachers perform the same general functions, but the skills they 
exercise may be viewed as specialized according to what is taught and who is being 
taught. 
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Under the concepts used to organize the General Catalog, the question of the 
general definition of teacher skillfuhiess can be resolved by noting that the less 
skillful teacher has control over only a few skills under a single approach to 
teaching a specific sut)ject or a specific age level. A highly skillful teacher, on the 
other hand, is one v^ho has mastered all of the skills necessary to perform each of 
several approaches within the subject matter taught. For example, a highly skilled 
social studies teacher is not merely skilled in the ''assignment-recitation" approach 
to teaching social studies, but also has full control over the social inquiry approach, 
the social participation approach and other approaches to instruction useful as 
teaching contexts vary. 



T^*acher education has a definite structure which incorporates selection, training, 
certification and placement, and finally, work success. At each point in this 
structure, several program options or choices may be identified. The research base 
for selecting one option over another is modest, although some pertinent informa- 
tion is available from research in teacher education and teacher behavior during the 
past 20 years. The urgent question of which teaching skills and antecedent knowl- 
edges might best be incorpoiated into teacher education programs cannot be 
answered very well on the basis of the available research information. Vor this 
reason, the alternate approach of the General Catalog of Teaching Skills, which is 
based on approaches to, or models of, teaching, is proposed as one workable way to 
resolve this question. 



Summary 
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